fix `pad_to_max_length` -> `padding` in transformers tokenizer encode #601

quic-indrbane · 2025-10-30T08:10:31Z

…code

Reproduction steps:
transformers == 4.55.0

download alpaca dataset

run command
python -m QEfficient.cloud.finetune --device qaic --model_name
meta-llama/Llama-3.1-8B --tokenizer_name meta-llama/Llama-3.1-8B
--max_train_step 20 --output_dir
meta-llama/Llama-3.1-8B_alpaca_dataset_run --dataset alpaca_dataset
--run_validation False

Expected error:
TypeError: PreTrainedTokenizerFast._batch_encode_plus() got an unexpected keyword argument 'pad_to_max_length'

…code Reproduction steps: transformers == 4.55.0 download alpaca dataset run command python -m QEfficient.cloud.finetune --device qaic --model_name \ meta-llama/Llama-3.1-8B --tokenizer_name meta-llama/Llama-3.1-8B \ --max_train_step 20 --output_dir \ meta-llama/Llama-3.1-8B_alpaca_dataset_run --dataset alpaca_dataset \ --run_validation False Expected error: TypeError: PreTrainedTokenizerFast._batch_encode_plus() got an unexpected keyword argument 'pad_to_max_length' Signed-off-by: Indrajit Banerjee <indrbane@qti.qualcomm.com>

quic-akuruvil

Hi @quic-indrbane 599 PR is already doing the same fix. Will merge that on priority to unblock the FT activities.

quic-rishinr · 2025-11-04T05:17:01Z

can we close this PR as changes is added as part of #599 ?

quic-indrbane requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners October 30, 2025 08:10

quic-indrbane force-pushed the indrbane/padding_fix branch from f469be4 to cd0b80e Compare October 30, 2025 08:47

quic-akuruvil reviewed Oct 30, 2025

View reviewed changes

quic-rishinr closed this Nov 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix `pad_to_max_length` -> `padding` in transformers tokenizer encode #601

fix `pad_to_max_length` -> `padding` in transformers tokenizer encode #601

Uh oh!

quic-indrbane commented Oct 30, 2025

Uh oh!

quic-akuruvil left a comment

Uh oh!

quic-rishinr commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix pad_to_max_length -> padding in transformers tokenizer encode #601

fix pad_to_max_length -> padding in transformers tokenizer encode #601

Uh oh!

Conversation

quic-indrbane commented Oct 30, 2025

Uh oh!

quic-akuruvil left a comment

Choose a reason for hiding this comment

Uh oh!

quic-rishinr commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix `pad_to_max_length` -> `padding` in transformers tokenizer encode #601

fix `pad_to_max_length` -> `padding` in transformers tokenizer encode #601